modern lstm #2752

wangjiawen2013 · 2025-01-28T04:06:47Z

Pull Request Template

Checklist

Confirmed that run-checks all script has been executed.
Made sure the book is up to date with changes in this PR.

Related Issues/PRs

Provide links to relevant issues and dependent PRs.

Changes

Summarize the problem being addressed and your solution.

Testing

Describe how these changes have been tested.

codecov · 2025-01-28T04:30:09Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 83.62%. Comparing base (9f00320) to head (af75564).
Report is 1 commits behind head on main.

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #2752   +/-   ##
=======================================
  Coverage   83.62%   83.62%           
=======================================
  Files         825      825           
  Lines      108686   108686           
=======================================
  Hits        90893    90893           
  Misses      17793    17793

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

wangjiawen2013 · 2025-01-28T08:18:46Z

Here is a test result, the results maybe a little different each time because the datasets were randomly generated.

This implementation is complementary to Burn's official LSTM, users can choose either one depends on the project's specific needs.

laggui

Great example! 🙂 I have some minor changes in mind.

We should also add it to the list of examples in the README and book.

The example could be restructured to follow the workspace examples (similar to what I did in your last wgan example), but that's not mandatory.

examples/modern-lstm/README.md

examples/modern-lstm/src/main.rs

examples/modern-lstm/src/model.rs

examples/modern-lstm/Cargo.toml

wangjiawen2013 · 2025-01-30T02:08:52Z

Good! I am not so familiar with Rust and Burn as you. I am glad that you can revise the code. I'll learn from you on the design logic and formatting.

Co-authored-by: Guillaume Lagrange <[email protected]>

wangjiawen2013 · 2025-01-30T05:32:22Z

Good! I am not so familiar with Rust and Burn as you. I am glad that you can revise the code. I'll learn from you on the design logic and formatting.
I am not clear about the working mechanisms of github. I find that you have modified the code in my pull request. Do I need modify the source code and pull a request again, or you will make all the changes you mentioned instead of me ? If I need to make the changes you suggest, I will modify and test it in my fork and pull a request again.

laggui · 2025-01-30T13:26:56Z

I am not clear about the working mechanisms of github. I find that you have modified the code in my pull request.

The specific changes for the Cargo.toml were pretty short so I was able to suggest the code changes from the github PR review, and as you can see it automatically committed them to your fork when you applied them.

Do I need modify the source code and pull a request again, or you will make all the changes you mentioned instead of me ? If I need to make the changes you suggest, I will modify and test it in my fork and pull a request again.

The contribution process usually happens in a sequential manner. A user (you) submits code changes, a contributor (me) reviews the code and requests modifications where needed, and the user can provide his thoughts on the requests and make the necessary changes before asking for another round of review. And the process is typically repeated. But everything remains as part of the same pull request 🙂

In your last PR, the only changes required were structural (change how the example needed to be exposed in the workspace) so I made the changes myself since I had time.

I'll let you make the changes, but if you need help let me know!

wangjiawen2013 · 2025-01-30T17:24:40Z

It's weired that the model performance decreased a little after these changes. Is the latest burn still unstable or the crates in the Cargo.toml will affect the performance ? I didn't change any parameters and the model. The only things changed are Cargo.toml and the shell commands. I tested the orginal one again, it indeed worked better than this new one.
This is the new one:

I think I find the reason. I copied the code from the wgan-generate.rs (https://github.com/tracel-ai/burn/blob/main/examples/wgan/examples/wgan-generate.rs), where the autodiffbackend was used. But in modern-lstm inference, I think the backend without autodiff must be used, otherwise the predictions will be erroneous because of the layernorm and dropout. Now it works as expected when using backend without autodiff for inference.

laggui · 2025-01-31T13:09:38Z

I think I find the reason. I copied the code from the wgan-generate.rs (https://github.com/tracel-ai/burn/blob/main/examples/wgan/examples/wgan-generate.rs), where the autodiffbackend was used.

Oh good catch! The structure likely was carried over from another example, but I just fixed it in #2736.

I'll fix the wgan example.

laggui

Fixed some conflicts with the main branch.

Thanks for the great example 🙂 we might add this LSTM implementation to our actual modules in the future

modern lstm

f0cddd5

wangjiawen2013 added 5 commits January 28, 2025 12:45

format

31477f2

formatting

2872169

formatting

a32e01f

formatting

1cf29c1

formatting

1717f71

fix a typo

dbf7639

laggui self-requested a review January 28, 2025 18:08

laggui requested changes Jan 29, 2025

View reviewed changes

Update examples/modern-lstm/Cargo.toml

64f10ab

Co-authored-by: Guillaume Lagrange <[email protected]>

use generic backend

2690895

wangjiawen2013 added 3 commits January 31, 2025 10:35

remove Cargo.lock

b27b228

use backend for inference

d555318

update readme

68a0f93

laggui added 2 commits February 3, 2025 10:18

Merge remote-tracking branch 'upstream/main'

7ecc9c4

Update README + fix main changes

9bdc8e9

laggui approved these changes Feb 3, 2025

View reviewed changes

Fix clippy

af75564

laggui merged commit e2fa935 into tracel-ai:main Feb 3, 2025
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

modern lstm #2752

modern lstm #2752

wangjiawen2013 commented Jan 28, 2025

codecov bot commented Jan 28, 2025 •

edited

Loading

wangjiawen2013 commented Jan 28, 2025 •

edited

Loading

laggui left a comment

wangjiawen2013 commented Jan 30, 2025

wangjiawen2013 commented Jan 30, 2025

laggui commented Jan 30, 2025 •

edited

Loading

wangjiawen2013 commented Jan 30, 2025 •

edited

Loading

laggui commented Jan 31, 2025

laggui left a comment

modern lstm #2752

modern lstm #2752

Conversation

wangjiawen2013 commented Jan 28, 2025

Pull Request Template

Checklist

Related Issues/PRs

Changes

Testing

codecov bot commented Jan 28, 2025 • edited Loading

Codecov Report

wangjiawen2013 commented Jan 28, 2025 • edited Loading

laggui left a comment

Choose a reason for hiding this comment

wangjiawen2013 commented Jan 30, 2025

wangjiawen2013 commented Jan 30, 2025

laggui commented Jan 30, 2025 • edited Loading

wangjiawen2013 commented Jan 30, 2025 • edited Loading

laggui commented Jan 31, 2025

laggui left a comment

Choose a reason for hiding this comment

codecov bot commented Jan 28, 2025 •

edited

Loading

wangjiawen2013 commented Jan 28, 2025 •

edited

Loading

laggui commented Jan 30, 2025 •

edited

Loading

wangjiawen2013 commented Jan 30, 2025 •

edited

Loading